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Abstract — This paper addresses the problem of sparse recovery 
with graph constraints in the sense that we can take additive 
measurements over nodes only if they induce a connected sub- 
graph. We provide explicit measurement constructions for several 
special graphs. A general measurement construction algorithm 
is also proposed and evaluated. For any given graph G with n 
nodes, we derive order optimal upper bounds of the minimum 
number of measurements needed to recover any fe-sparse vector 
over G (M^ n ). Our study suggests that M^ n may serve as a 
graph connectivity metric. 

I. Introduction 

Network monitoring is an important module in the operation 
and management of communication networks, where network 
performance characteristics, such as traffic transmission rates 
and router queueing delays, should be monitored. Since moni- 
toring each object in the network directly can be operationally 
difficult or even infeasible, the topic of inferring internal 
characteristics using information from indirect end-to-end (ag- 
gregate) measurements, known as Network Tomography, has 
been widely explored recently H71. ifTOl. Ifl2l. lEOl. Il22l. l25l. 

go. 

In practice, the total number of aggregate measurements 
we can take is small compared with the size of the network. 
However, we can indeed extract the most dominating ele- 
ments of a high-dimensional signal from low-dimensional non- 
adaptive measurements. With the signal itself being sparse, 
i.e. most entries are zero, the recovered signal can be exact 
even though the number of measurements is much smaller 
than the dimension of the signal. One practical example is that 
only a small number of bottleneck links in the communication 
networks experience large delays. Sparse Recovery addresses 
the problem of recovering sparse high-dimensional signals 
from low-dimensional measurements, and has two different 
but closely related problem formulations. One is Compressed 
Sensing O, |9l, H3, flOl, ED, where the signal is rep- 
resented by a high-dimensional real vector, and an aggregate 
measurement is the arithmetical sum of the corresponding real 
entties. The other is Group Testing Ifl8l , |[T9l , where the high- 
dimensional vector is logical, and a measurement is a logical 
disjunction (OR) on the corresponding logical values. 

One key question in both compressed sensing and group 
testing is to design a small number of non-adaptive measure- 
ments (either real or logical) such that all the vectors (either 
real or logical) up to certain sparsity (the support size of 
a vector) can be correctly recovered. Most existing results, 
however, rely critically on the assumption that any subset 



of the values can be aggregated together [8|, |16|, which is 
not realistic in the network monitoring problem. Here only 
objects that can form a path or a cycle on the graph ll22l . or 
induce a connected subgraph can be combined together in the 
same measurement. Only a few recent works consider graph 
topological constraints in compressed sensing lfT3l . ETI . [24|, 
ED, 02 and group testing 0, JED, ED, (26], l|29l. 

Though motivated by the network monitoring application, 
beyond networks. Indeed, this formulation abstractly models 
that certain elements cannot be measured together in a com- 
plex system. Thus, our work can be useful to other applications 
besides network tomography. 

Here are the main contributions of this paper. 

(1) We provide explicit measurement constructions for dif- 
ferent graphs. Moveover, the number of our measurements 
improves over the existing estimates (e.g. ifTTl . OTl ) of the 
minimum number of measurements required to recover sparse 
vectors over graphs. (Section HIH> 

(2) We propose a design guideline based on r-partition for 
general graphs and further show some of its properties. (Sec- 
tion ISA) 

(3) A simple measurement design algorithm is proposed for 
general graphs. (Section HV-Bt We evaluate its performance 
both theoretically and numerically. (Section [V} 

We now start with Section [TT] to introduce the model and 
problem formulation. 

II. Model and Problem Formulation 

Consider a graph G = (V, E), where V denotes the set 
of nodes with cardinality |V| = n and E denotes the set of 
links. Each node i is associated with a real number Xi, and 
we say vector x = (xi,i = 1, n) is associated with G. Let 
T = {i | Xi 7^ 0} denote the support of x, and let ||x||o = |T| 
denote the number of non-zero entries of x, we say x is a 
fc-sparse vector if ||x||o = k. 

Let S C V denote a subset of nodes in G. Let Es denote 
the subset of links with both ends in S, then Gs = (S, Es) 
is the induced subgraph of G. We have the following two 
assumptions throughout the paper: 

(Al): A set S of nodes can be measured together in one 
measurement if and only if Gs is connected. 
(A2): The measurement is an additive sum of values at the 
corresponding nodes. 

(Al) captures the graph consttaints. One practical example 
is a sensor network where the nodes represent sensors and the 
links represent feasible communication between sensors. For 



the set S of nodes that induce a connected subgraph, one node 
u in S monitors the total values corresponding to nodes in S. 
Every node in S obtains values from its children, if any, on 
the spanning tree rooted at u, aggregates them with its own 
value and sends the sum to its parent. Then the fusion center 
can obtain the sum of values corresponding to all the nodes 
in S by only communicating with u. (A2) follows from the 
additive property of many network characteristics, e.g. delays 
and packet loss rates ll22l . However, compressed sensing can 
also be applied to cases where (A2) does not hold, e.g., the 
measurements can be nonlinear as in |5), ||27ll . 

Let y £ lZ m (m <C n) denote the vector of m measure- 
ments. Let A be an m x n measurement matrix with Aij = 1 
(i = l,...,m, j = l,...,n) if and only if node j is included 
in the ith measurement and Aij = otherwise. Then we have 
y = Ax.. We say A can identify all fc-sparse vectors if and only 
if Ax.1 7^ Ax.2 for every two different vectors Xi and X2 that 
are at most fc-sparse. The advantage of sparse recovery is that 
with the non-adaptive measurement matrix A, it can identify 
n-dimensional vectors from m (m <C n) measurements as 
long as the vectors are sparse. 



TABLE I 
SUMMARY OF KEY NOTATIONS 




Fig. 1. Network Example 

With the above assumptions, A is a 0-1 matrix and for each 
row of A, the set of nodes that correspond to ' 1' should form 
a connected induced subgraph of G. In Fig.[U we can measure 
nodes in Si and 52 separately, and the measurement matrix is 
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We remark here that in group testing with graph constraints, 
the requirements for the measurement matrix A are the same, 
while group testing differs from compressed sensing only in 
that (1) x is a logical vector, and (2) the operation used in each 
group testing measurement is the logical "OR". All arguments 
and results in this paper are in the compressed sensing setup if 
not otherwise specified, and we also compare our results with 
group testing for special networks. Note that for recovering 
1 -sparse vectors, the numbers of measurements required by 
compressed sensing and group testing are the same. 

Given a graph G with n nodes, let denote the mini- 

mum number of non-adaptive measurements needed to identify 
all fc-sparse vectors associated with G. Let M^ n denote the 
minimum number of non-adaptive measurements needed in a 
complete graph with n nodes. In complete graphs, since any 
subset of nodes can be measured together, any 0-1 matrix is 
a feasible measurement matrix. Existing results (4), J9], [30] 
show that with overwhelming probability a random 0-1 matrix 



Notation 


Meaning 


G S 


Subgraph of G induced by S 




Minimum number of measurements needed to recover fc-sparse 
vectors associated with G of n nodes. 


fc ,77, 


Minimum number of measurements needed to recover fc-sparse 
vectors associated with a complete graph of n nodes. 


f{k,n) 


Number of measurements constructed to recover fc-sparse 
vectors associated with a complete graph of n nodes 



with 0(k\og{n/k)) row£] can identify all fc-sparse vectors, 
and we can recover the sparse vector by £\ -minimization, 
which returns the vector with the least ^i-nor among those 
that can produce the obtained measurements. Then we have 

M^ n = 0(klog(n/k)). (1) 

We will use (Q~|i for the analysis of construction methods. 
Explicit constructions of measurement matrices for complete 
graphs also exist, e.g., HI, 0, |E), El, ED. We will use 
/(fc, n) to denote the number of measurements to recover fc- 
sparse vectors associated with the complete graph of n nodes 
by a particular measurement construction method, and /(fc, n) 
varies for different construction methods. The key notations 
are summarized in Table HI 

The questions we would like to address in the paper are: 

• Given graph G, what is the corresponding ? 

• How to explicitly design measurements such that the total 
number of measurements is close to MP ? 

III. Sparse Recovery over Special Graphs 

In this section, we consider four kinds of special graphs: 
one-dimensional line/ring network, ring with each node con- 
necting to four closest neighbors, two-dimensional grid and 
a tree. We construct measurements for each graph and later 
generalize the construction ideas obtained here to general 
graphs in Section HVl 

A. Line and Ring 

First consider one-dimensional line/ring network as shown 
in Fig. [2] When later comparing the results here with those in 
Section IIII-BI one can see that the number of measurements 
required to recover sparse vectors can be significantly different 
in two graphs that only differ from each other with a small 
number of links. 

In a line/ring network, there is not much freedom in the 
measurement design since only consecutive nodes can be mea- 
sured together from assumption (Al). In fact, 11231 . [26| show 
that r 2 ^! (or \^~\) measurements are both necessary and 
sufficient to recover 1 -sparse vectors associated with a line (or 
ring) network with n nodes. Therefore, 8(n) measurements 

'We use the notations g(n) £ 0(h(n)), g(n) S Q(h(n)), or g(n) = 
@(h(n)) if as n goes to infinity, g(n) < ch(n), g(n) > ch(n) or c\h(n) < 
g(n) < C2h(n) eventually holds for some positive constants c, c\ and C2 
respectively. 

2 The £p-norm (p > 1) of x is ||x|| p = (y] ■ |2j| p ) 1 / p , and ||x||oo = 
maxi |aii|. 




(b) 



Fig. 2. (a) line network (b) ring network 



are required to recover even one non-zero element associated 
with a line/ring network. 

We next construct fefj^j] + 1 measurements to recover k- 
sparse vectors (fc > 2) associated with the line/ring network. 
Let t = ■ F° r every 1 < i < kt+1, the ith measurement 

goes through all the nodes from i to min(i + 1 — 1, n). 

Theorem 1. k\j^~\ + 1 above measurements are sufficient 
to identify all k-sparse vectors associated with a line/ring 
network with n nodes. 

Proof: Consider matrix J 4( tfe + 1 ) x (* fc +*) with its zth row 
having Ts from entry i to entry i+t—1 and 'O's elsewhere for 
all 1 < i < tk + 1. Then the first n columns of A correspond 
to our measurement matrix. To prove the statement, we only 
need to show that A can identify all fc-sparse vectors in lZ tk+t , 
which happens if and only if every non-zero vector z such that 
Az = holds has at least 2k + 1 non-zero elements |8). 

For each index 1 < k' < k, define a submatrix Ay, which 
consists of the first tk' + 1 rows and the first tk' +t columns of 
A. We claim that every non-zero vector w such that A^-w = 
holds has at least 2k' + 1 non-zero elements with at least two 
non-zero elements in the last t entries. We prove this claim 
by induction over k' . 

First consider A\. Note that its first row has Ts from 
column 1 to t, and its last row has Ts from column t + 1 to 
2t. Because any two columns of the submatrix A\ are linearly 
independent, for any w ^ such that A±w = 0, w must have 
at least three non-zero elements. Let j be the index of the last 
non-zero element of w. If j < t, consider the jth row of A\ 
with its first ' 1 ' entry in the jth column. The inner product of 
the jth row and w is non-zero, contradicting the assumption 
that Aiw = 0. Then j > t + 1 must hold. Then since the 
inner product between w and the last row of A\ is zero, at 
least two non-zero elements exist in the last t entries of w. 

Now suppose the claim holds for A^, consider a non-zero 
vector w such that A^+iW = holds. Note that the vector 
of the first tk' + t positions of w, denoted by w, satisfies 
Ak'W = 0. We remark that w ^ 0. If w = 0, let j denote 
the index of the first non-zero element of w, and we have 
j > tk' + t + l. Consider the ( j + 1 - t)th row of Ay+i with 
its last T entry in column j. Then the inner product of this 
row with w is non-zero, which is a contradiction. 

Since w/0, from the induction assumption, it has at least 
2k' + 1 non-zero elements with at least two non-zero elements 
in its last t elements. Now consider the last 2t elements of 
w and the last t + 1 measurements in Ak>+i- From a similar 
argument for the case of A\, we know that w must have at 



least two non-zero elements in the last t positions. So w has 
at least 2(k' + 1) + 1 non-zero elements. 

By induction over k', every w/0 satisfying Aw = has 
at least 2k + 1 non-zero entries. This completes the proof. ■ 

Theorem Q] implies that we can save about LfptJ mea- 
surements but still be able to recover fc-sparse vectors in a 
line/ring network via compressed sensing. However, for group 
testing associated with a line/ring network, one can check 
that n measurements are necessary to recover more than one 
non-zero element. The key is that every node should be the 
endpoint at least twice, where the endpoints are the nodes at 
the beginning and the end of a measurement. The endpoints of 
a measurement can be a same node. If node u is an endpoint 
for at most once, then it is always measured together with 
one of its neighbors, say v, if ever measured. Then when v 
is T, we cannot determine the value of u, either T or '0'. 
Therefore, to recover more than one non-zero element, we 
need at least 2n endpoints, and thus n measurements. 

B. Ring with nodes connecting to four closest neighbors 

We know from Section UlI-AI that \n/2~\ measurements are 
necessary to recover even one non-zero element associated 
with a ring network. Now consider a graph with each node 
directly connecting to its four closest neighbors as in Fig. [3] 
(a), denoted by 4 . Q A is important to the study of small-world 
networks [28|. Q A has n more links than the ring network, 
but we will show that the number of measurements required 
by compressed sensing to recover fc-sparse vectors associated 
with Q A significantly reduces from Q(n) to 0(k\og(n/k)). 

Throughout the paper, given a graph G = (V, E), we say 
S forms a hub for U if Gs is connected, and for every u 
in U, there exists s in S such that (u, s) E E. Clearly the 
set of all the odd nodes, denoted by T Q , form a hub for the 
set of all the even nodes, denoted by T e . Given a fc-sparse 
vector x, let x D and x e denote the subvectors of x with odd 
and even indices. Then x D and x e are at most fc-sparse. The 
sum of entries in x , denoted by s , can be obtained by one 
measurement, and similarly for the sum s e of the entries of 
x e . For any subset W of T e , T a U W induces a connected 
subgraph and thus can be measured by one measurement. We 
can obtain the sum of values corresponding to nodes in W by 
measuring nodes in T Q U W and then subtracting s from the 
sum. For example in Fig. [3] (b) and (c), in order to measure 
the sum of the pink nodes 2, 8 and 10, we measure the sum of 
pink nodes and all the black odd nodes, and then subtract s 
from the obtained summation. Though the subgraph induced 
by T e are not complete, we can indeed freely measure nodes 
in T e with the help of the hub T a . Therefore M^ n/2 ^ + 1 
measurements are enough to recover x e € 7\!.L"/ 2 J 5 where the 
additional one measurement measures s . Similarly, we can 
use T e as a hub to recover the subvector x e T^T"/ 2 ! with 
Mj? [n/2l + 1 measurements, and thus x is recovered. From 
above, we have 

Theorem 2. All k-sparse vectors associated with Q A can be 
recovered with Mp ', ,„ , + Mp' r , nl + 2 measurements, which 

fe, [71/ 2 \ k.\n/2\ ' 




is 0(2felog(n/(2fc)))+2. 

Theorem |2] is important in the following three aspects. 

Firstly, from ring network to Q 4 , although the number 
of links only increases by n, the number of measurements 
required to recover fc-sparse vectors significantly reduces from 
6(n) to (O(2felog(n/(2fe))) + 2. Besides, this value is in the 
same order as M^' n , while the number of links in Q 4 is only 
2n compared with n(n — l)/2 links in a complete graph. 

Secondly, the idea of using a hub to design the measure- 
ments is very important for our later results. If set S can 
serve as a hub for U in graph G, then the induced graph Gjj 
is "almost equivalent" to a complete subgraph in the sense 
that we can measure any subset of nodes in U freely via 
S. The number of measurements required to recover fc-sparse 
vectors associated with U is M^, v , + 1 with one additional 
one measurement for the hub. 

Thirdly, our estimate 0(2fclog(n/(2fe))) + 2 on the min- 
imum number of measurements required to recover fc-sparse 
vectors greatly improves over the existing results in ifTTl . |[3~Tll . 
both of which are based on the mixing time of a random 
walk. The mixing time T(n) is the smallest t' such that a 
random walk of length t' starting at any node in G ends up 
having a distribution fj,' with \\fi — m'IIoo < l/(2cn) 2 for some 
c > 1, where fi is the stationary distribution over the nodes 
of a standard random walk over the graph G. BTl proves that 
0(kT 2 (n) logn) measurements can identify fc-sparse vectors 
with overwhelming probability by compressed sensing. IfTTl 
uses 0(k 2 T 2 (n) log(n/fc)) measurements to identify fc non- 
zero elements by group testing. In Q 4 , one can easily see that 
T(n) should be at least n/4. Then both results provide no 
saving in the number of measurements for Q 4 as the mixing 
time is 8(n). 

Besides the explicit measurement construction described 
before Theorem we can also recover fc-sparse vectors with 
O(logn) random measurements with high probability. We 
need to point out that these random measurements do not 
depend on the measurements of a complete graph. 

Consider an n-step Markov chain {Xk,l < k < n} with 
X\ = 1. For any k < n — 1, if Xk = 0, then Xk+i = 1; 
if Xk = 1, then Xk+i can be or 1 with equal probability. 
Clearly any realization of this Markov chain does not contain 
two or more consecutive zeros, and thus is a feasible row of 
the measurement matrix. Moreover, 



Theorem 3. With high probability all k-sparse vectors asso- 
ciated with Q 4 can be recovered with 0(g(k) logn) measure- 
ments obtained from the above Markov chain, where g{k) is 
a function of fc. 

Proof: See Appendix. ■ 
Adding n links in the form + 2 (mod n)) to the ring 
network greatly reduces the number of measurements needed 
from <d(n) to O(logn). Then how many links in the form 
+ 2 (mod n)) shall we add to the ring network such that 
the minimum number of measurements required to recover fc- 
sparse vectors is exactly 9 (log n)? The answer is n— 6 (log n). 
To see this, let denote the graph obtained by deleting h 
links in the form + 2 (mod n)) from Q 4 . For example in 
Fig. [3(d), we delete links (3,5), (8,10) and (9,11) in red 
dashed lines from Q 4 . Given h, our following results do not 
depend on the specific choice of links to remove. We have 

Theorem 4. The minimum number of measurements required 
to recover k-sparse vectors associated with is lower 
bounded by \h/2], and upper bounded by 2M^„^ + h + 2. 

Proof: Let D denote the set of nodes such that for every 
i € D, link (i — 1, i+ 1) is removed from Q 4 . The proof of the 
lower bound follows the proof of Theorem 2 in l26l . The key 
idea is that recovering one non-zero element in D is equivalent 
to recovering one non-zero element in a ring network with h 
nodes, and thus \h/2] measurements are necessary. 

For the upper bound, we first measure nodes in D separately 
with h measurements. Let S contain the even nodes in D 
and all the odd nodes. S can be used as a hub to recover 
the fc-sparse subvectors associated with the even nodes that 
are not in D, and the number of measurements used is at 
most yni + 1. We similarly recover fc-sparse subvectors 
associated with odd nodes that are not in D using the set of 
the odd nodes in D and all the even nodes as a hub. The 
number of measurements is at most M^r„-, + 1. Sum them 
up and the upper bound follows. ■ 

Together with (Q~|i, Theorem [4] directly implies that if 
Q(\ogn) links in the form (i, i + 2(mod n)) are deleted 
from Q 4 , then 9 (logn) measurements are both necessary and 
sufficient to recover fc-sparse vectors associated with £?0n O g„^ 
for any constant fc. Moreover, the lower bound in Theorem [4] 
implies that if the number of links removed is f2(logn), then 
the number of measurements required for sparse recovery is 





(a) The set of black nodes as a hub (b) Measure pink nodes via the hub 
Fig. 4. Sparse recovery on two-dimensional grid 



also fi(logn). Thus, we need to add n — 8(logn) links to a 
ring network such that the number of measurements required 
for sparse recovery is exactly O(logn). 

Since the number of measurements required by compressed 
sensing is greatly reduced when we add n links to the ring 
network, one may wonder whether the number of measure- 
ments needed to locate k non-zero elements by group testing 
can also be greatly reduced or not. Our next result shows that 
this is not the case for group testing. 

Proposition 1. [ n /4j measurements are necessary to locate 
two non-zero elements associated with Q 4 by group testing. 

Proof: Suppose two non-zero elements are on nodes 2i — 1 
and 2i for some 1 < i < J . We view nodes 2i — 1 and 2i 
as a group for every i (1 < i < [^\), denoted by Bi. If both 
nodes in Bj are Ts for some j, then every measurement that 
passes either node or both nodes in Bi is always ' 1 ' . Consider 
a reduced graph with Bi, Vi as nodes, and link (Bi, Bj) (i ^ j) 
exists only if in Q 4 there is a path from a node in Bi to a node 
in Bj without going though any other node not in Bi or Bj. 
The reduced network is a ring with [§J nodes, and thus [n/4\ 
measurements are required to locate one non-zero element in 
the reduced network. Then the lower bound follows. ■ 
By Theorem [2] and Proposition Q] we observe that in Q 4 , 
with compressed sensing the number of measurements needed 
to recover fc-sparse vectors is 0(2k log(n/(2fc))), while with 
group testing, O(n) measurements are required if k > 2. 

C. Two-dimensional grid 

Next we consider the two-dimensional grid, denoted by Q 2d . 
Q 2d has y/n rows and y/n columns. From now on we skip ' [•] ' 
and ' |_-J ' for notational simplicity, but note that the number of 
nodes should always be an integer. 

We assume ^fn to be even here for notational simplicity, 
and the result can be easily modified for the case that y/n is 
odd. The idea of measurement construction is similar to that 
for graph Q 4 . First, Let S\ contain the nodes in the first row 
and all the nodes in the odd columns. Then S\ can be used as 
a hub to measure fc-sparse subvectors associated with nodes 
in V\S\, as shown in Fig. [4] The number of measurements 
is M? ' , ,„ + 1. Then let So contain the nodes in the 

first row and all the nodes in the even columns, and use S2 
as a hub to recover up to fc-sparse subvectors associated with 
nodes in V^S^- Then number of measurements required is also 



Mr, ,„ r ,„ + 1. Finally, use nodes in the second row as a 
hub to recover sparse subvectors associated with nodes in the 
first row. Since nodes in the second row are already identified 
in the above two steps, then we do not need to measure the 
hub separately in this step. The number of measurements here 

is MP ^. Therefore, 

k, v n 

Theorem 5. The number of measurements needed to re- 
cover k-sparse vectors associated with Q 2d is at most 



2M? 



2. 



D. Tree 

Next we consider a tree topology as in Fig. [5] For a given 
tree, the root is treated as the only node in layer 0. The nodes 
that are t steps away from the root are in layer t. We say the 
tree has depth h if the farthest node is h steps away from the 
root. Let rii denote the number of nodes on layer i, and tiq = 1. 
We construct measurements to recover vectors associated with 
a tree by the following tree approach. 




Fig. 5. Tree topology 

We recover the nodes layer by layer starting from the root, 
and recovering nodes in layer i requires that all the nodes 
above layer i should already be recovered. First measure the 
root separately. When recovering the subvector associated with 
nodes in layer i (2 < i < h), we can measure the sum of any 
subset of nodes in layer i using some nodes in the upper layers 
as hub and then delete the value of the hub from the obtained 
sum. One simple way to find a hub is to trace back from nodes 
to be measured on the tree simultaneously until they reach one 
same node. For example in Fig. [5] in order to measure nodes 
5 and 7 together, we will trace back to the root and measure 
nodes 1, 2, 3, 5, and 7 together and then subtract the values 
of nodes 1, 2, and 3, which are already identified when we 
recover nodes in the upper layers. With this approach, we have, 

Theorem 6. X^Lo-^/Pn measurements are enough to recover 
k-sparse vectors associated with a tree with depth h, where 
rii is the number of nodes in layer i. 

IV. Sparse Recovery over General Graphs 

In this section we consider recovering fc-sparse vectors over 
general graphs. The graph is assumed to be connected. If not, 
we simply treat each component as a connected subgraph and 
design measurements to recover fc-sparse subvectors associated 
with each subgraph separately. 

Inspired by the construction methods in Section [III] in 
Section IIV-AI we propose a general design guideline based 
on "r-partition" which will be introduced soon. The key idea 



is to divide the nodes into a small number of groups such 
that nodes in the same group are connected to one hub, and 
thus can be measured freely with the help of the hub. We 
use the Erdos-Renyi random graph as an example to illustrate 
the design guideline based on r-partition. Since finding the 
minimum number of such groups in general turns out to be 
NP-hard, in Section IIV-BI we propose a simple algorithm to 
design a small number of measurements to recover fc-sparse 
vectors associated with any given graph. 

A. Measurement Construction Based on r-partition 

In Q 4 , we divide nodes into odd nodes T Q and even nodes T e 
and use each set as a hub for the other set. In general graphs, 
we extend this idea and have the following definition: 

Definition 1 (r-partition). Given G = (V 1 E), disjoint subsets 
Ni (i = 1, r) of V form an r-partition of G if and only if 
these two conditions both hold: (1) U[ =1 -/V,; = V, and (2) Vi, 
V\N is a hub for Ni. 

Clearly, T and T e form a 2-partition of graph Q 4 . With the 
above definition, we have 

Theorem 7. If G has an r-partition N (i = 1, r), then the 
number of measurements needed to recover k-sparse vectors 
associated with G is at most 2~2l=i ^k\N\ + r > which is 
0(rk log(n/fc)) + r. 

Proof: Note that M^ N | + 1 measurements (with one 
additional measurement for V\Ni) are enough to recover fc- 
sparse subvector associated with N via its hub V\N. ■ 
We next apply this result to the Erdos-Renyi random graph 
G(n,p), which contains n nodes and there exists an link 
between any two nodes independently with probability p. Note 
that if p > (1 + e) log n/n for some constant e > 0, G(n,p) 
is connected almost surely ll6l . 

Theorem 8. For Erdos-Renyi random graph G(n,p) with p = 
P log n/n, if 6 > 2 + efor some constant e > 0, then any two 
disjoint subsets N\ and N2 of nodes with \N\\ = \N2 \ = n/2 
form a 2-partition with high probability. Moreover, with high 
probability the number of measurements needed to recover k- 
sparse vectors associated with G(n,p) is at most 2Mf? n ^ 2 + 2, 
which is 0(2fclog(n/(2fc))) + 2. 

Proof: Let iVi be any subset of V with |iVi| = n/2, and 
let N2 = V\N\. Then Gjvi and Gn 2 are both Erdos-Renyi 
random graphs with n/2 nodes, and are connected almost 
surely when p > (2 + e) log n/n. 

We claim that with high probability, for every u € Ni, 
there exists v £ N2 such that (u, v) g E. Let Pi denote the 
probability that there exists some u € N% such that (u, v) ^ E 
for every v g N2. Then 

K = E ( J -Py^ 2 = ^ -^ogn/n)"/ 2 

n, 8 log n . giog n giogn n~ c / 2 
2 V n ' ~ 2 ~ 2 ' 



where the last inequality holds from j3 > 2 + e. Then Pi goes 
to zero as n goes to infinity, and the claim follows. Similarly, 
one can prove that with high probability for every v £ N2, 
there exists u £ Ni such that (u, v) £ E. 

Then with high probability Ni and iV2 form a 2-partition. 
The second statement follows from Theorem [7] ■ 

ifTTl considers group testing over Erdos-Renyi random 
graphs and shows that 0{k 2 log 3 n) measurements are enough 
to identify up to fc non-zero entries in an n-dimensional 
logical vector provided that p = 0(fclog 2 n/n). Here with 
compressed sensing setup and 2-partition results, we can 
recover fc-sparse vectors in lZ n with 0(2fc log(n/(2fc))) + 2 
measurements when p > (2 + e) log n/n for some e > 0. Note 
that this result also improves over the previous result in [31 1, 
which requires 0(fclog 3 n) measurements for compressed 
sensing on G(n,p). 

From Theorem [7] the number of measurements used is 
closely related to the value r. In general one wants to reduce 
r so as to reduce the number of measurements. Given graph 
G and integer r, the question that whether or not G has an 
r-partition is called r-partition problem. In fact, 

Proposition 2. Vr > 3, r-partition problem is NP-complete. 

Please refer to Appendix for its proof. We remark that we 
cannot prove the hardness of the 2-partition problem though 
we conjecture it is also a hard problem. 

B. Measurement Construction Algorithm for General Graphs 

Section IIV-AI proposes the r-partition concept as a mea- 
surement design guideline. But finding an r-partition with 
the smallest r in general is NP-hard. Now given a connected 
graph G, how shall we efficiently design a small number of 
measurements to recover fc-sparse vectors associated with G? 

One simple way is to find the spanning tree of G, and 
then use the tree approach in Section IIII-DI The depth of the 
spanning tree is at least R, where R = min^gy max„ £ y d uv 
is the radius of G with d uv as the length of the shortest path 
between u and v. This approach only uses links in the spanning 
tree, and the number of measurements used is large when the 
radius R is large. For example, the radius of Q 4 in Fig. [3] 
is rt/4, then the spanning tree approach uses at least n/4 
measurements, one for each layer. However, the number of 
measurements can be as small as 0(2fc log(n/2fc)) + 2 if we 
take advantage of the additional links. 

Here we propose a simple algorithm to design the measure- 
ments for general graphs. The algorithm combines the ideas 
of the tree approach and the r-partition. We still divide nodes 
into a small number of groups such that each group can be 
identified via some hub. Here nodes in the same group are the 
leaf nodes of a spanning tree of a gradually reduced graph. A 
leaf node has no children on the tree. 

Let G* — (V* , E*) denote the input graph. The algorithm is 
built on the following two subroutines. Leaves(G, u) returns 
the set of leaf nodes of a spanning tree of G rooted at u. 
Reduce(G = (V, E), u, H) deletes u from G and fully 
connects all the neighbors of u. Specifically, for every two 



Subroutine 1 Leaves(G, u) 
Input: graph G, root u 

1 Find a spanning tree T of G rooted at u by breadth-first 
search, and let S denote the set of leaf nodes of T. 

2 Return: S 



Subroutine 2 Reduce(G, u, H) 

Input: G = (V,E), H e for each e e E, and node u 

1 V = V\u. 

2 for each two different neighbors v and w of u do 

3 if (v,w) $ E then 

4 E = E U (v, w), H {v ^, } = H {v . u) U H^ W ) U {u}. 

5 end if 

6 end for 

7 Return: G, H 



neighbors v and u; of u, we add a link (v, w), if not already 
exist, and let Ht vw \ = Ht v u ) U H( u ,w) U { u }> where for each 
link (s,t) € S, i?( s .t) denotes the set of nodes, if any, that 
serves as a hub for s and i in the original graph G* . We record 
H such that measurements constructed on a reduced graph G 
can be feasible in G*. 

Given graph G*, let u denote the node such that 
max„ £ i/. d uv = R, where R is the radius of G*. Pick u 
as the root and obtain a spanning tree T of G* by breadth- 
first search. Let S denote the set of leaf nodes in T. With 
V*\S as a hub, we can design f(k, \S\) + 1 measurements 
to recover up to fc-sparse vectors associated with S. We then 
reduce the network by deleting every u in S and fully connects 
all the neighbors of u. For the obtained reduced network G, we 
repeat the above process until all the nodes are deleted. Note 
that when designing the measurements in a reduced graph G, 
if a measurement uses link (v, w), then it should also include 
nodes in Hr v<w \ so as to be feasible in the original graph G*. 

In each step tree T is rooted at node u where max„ 6 y d uv 
equals the radius of the current graph G. Since all the leaf 
nodes of T are deleted in the graph reduction procedure, the 



Algorithm 1 Measurement construction for graph G* 
Input: G* = (V*,E*). 

1 G = G* H e = for each e G E 

2 while | V] > 1 do 

3 Find the node u such that max„ e y d uv — R G , where 
R G is the radius of G. S =Leaves(G, u). 

4 Design f(k, \S\) + 1 measurements to recover fc-sparse 
vectors associated with S using nodes in V\S as a hub. 

5 for each u in S do 

6 G = Reduce(G, u, H) 
i end for 

8 end while 

9 Measure the last node in V directly. 
10 Output: All the measurements. 



radius of the new obtained graph should be reduced by at least 
one. Then we have at most R iterations in Algorithm [T] until 
only one node is left. Clearly we have, 

Proposition 3. The number of measurements designed by 
Algorithm Q] is at most Rf(k,n) + R + 1, where R is the 
radius of the graph. 

We remark that the number of measurements by the span- 
ning tree approach we mentioned at the beginning of Section 
IIV-BI is also no greater than Rf(k, n) + R + 1. However, we 
expect that Algorithm 1 uses fewer measurements than the 
spanning tree approach for general graphs, since Algorithm 1 
also considers links that are not in the spanning tree. And it 
is verified in Experiment 1 in Section [V] 

V. Simulation 

Experiment 1 (Effectiveness of Algorithm 1): Given a graph 
G, we consider recovering 1-sparse vectors associated with 
G. Note that Mp n — [log(n + 1)] and the corresponding 
measurement matrix has the binary expansion of i as column 
i ifTSl . Algorithm 1 divides the nodes into groups such that 
each group (except the last one) can be measured freely via 
some hub. The last group only contains one node and can 
be measured directly. The total number of measurements by 
Algorithm 1 is ri°g( n i + l)l +?> where is the number 

of nodes in group i and q is the total number of groups. 

In Fig. 6, we gradually increase the number of links in 
a graph with n = 1000 nodes. We start with a uniformly 
generated random tree, and in each step randomly add 25 
links that do not already exist. All the results are averaged 
over one hundred realizations. The number of measurements 
constructed decreases from 73 to 30 when the number of links 
increases from n — 1 to 2ri — 1. Note that the number of 
measurements is already within 3Mp n when the average node 
degree is close to 4. The radius of the graph decreases from 
13 to 7, and we also plot the upper bound in Proposition [3] 
One can see that the number of measurements constructed can 
be much less than the upper bound. 

In Fig. 7, we consider the scale-free network with Barabasi- 
Albert (BA) model J5] where the graph initially has mo 
connected nodes, and each new node connects to rn existing 
nodes with a probability that is proportional to the degree of 
the existing nodes. We start with a random tree of 10 nodes 
and increase the total number of nodes from 64 to 1024. Every 
result is averaged over one hundred realizations. One can see 
that the number of measurements constructed is proportional 
to logn, and decreases when m increases. 

Experiment 2 (Sparse Recovery Performance with Noise): 

Compressed sensing theory indicates that if A is a random 
0-1 matrix, with overwhelming probability we can recover the 
sparse vector xo though l\ -minimization |8|. Here we generate 
a graph with n = 500 nodes from BA model. Algorithm 1 
divides nodes into four groups with 375, 122, 2 and 1 node 
respectively. For each of the first two groups with size nt 
(i = 1,2), we generate [rij/2] random measurements each 




measuring a random subset of the group together with its hub. 
We also measure the two hubs directly. Each of the three nodes 
in the next two groups is measured directly by one measure- 
ment. The generated matrix A is 254 by 500. We generate 
a sparse vector xo with i.i.d. zero-mean Gaussian entries on 
a randomly chosen support, and normalize 1 1 xo 1 1 2 to 1. To 
recover Xo from y = Axo, one can run £i-minimization to 
recover the subvectors associated with the first two groups, and 
the last three entries of Xo can be obtained from measurements 
directly. However, note that every measurement for the first 
two groups passes through its hub, then any error in a hub 
measurement will affect every measurement for the group of 
nodes using this hub. To address this issue, we propose to 
use a modified l\ -minimization in which the errors in the two 
hubs are treated as entries of an augmented vector to recover. 
Specifically, let the augmented vector z = [x^,ei,e2] T and 
the augmented matrix A' — [A j3 7], where e\ (or e<z) 
denotes the error in the measurement of the first (second) 
hub, and the column vector (3 (or 7) has '1' in the row 
corresponding to the measurement of the first (or second) 
hub and '0' elsewhere. We then recover z (and thus Xo) from 
y = A'z via l\ -minimization on each group. Fig. 8 compares 
the recovery performance of our modified recovering method 
and the traditional l\ -minimization, where the hub errors e\ 
and &2 are drawn from a Gaussian distribution with zero mean 
and unit variance. For every support size k, we randomly 
generate one hundred fc-sparse vectors xo, and let x r denote 
the recovered vector. Even with the hub errors, the average 
l x r — x o||2 is within 10 -6 when Xo is at most 25-sparse by our 
method, while by £\ -minimization, the value is at least 0.5. We 
also consider the case that besides errors in hub measurements, 
every other measurement has i.i.d. zero-mean Gaussian noise. 
Let w denote the noise vector and 1 1 w 1 1 2 is normalized to 2. 
The average 1 1 x r — xo 1 1 2 here is smaller with our method than 
that with ^-minimization. 

VI. Conclusion 

This paper addresses the sparse recovery problem with 
graph constraints. By providing explicit measurement con- 
structions for different graphs, we derive upper bounds of 



the minimum number of measurements needed to recover 
vectors up to certain sparsity. It would be interesting to explore 
corresponding tight lower bounds. Further efforts are also 
needed to empirically evaluate the performance of different 
recovery themes, especially when the measurements are noisy. 

Appendix 

A. Proof of Theorem |2 

Let A mxn denote the matrix with m realizations of the n- 
step Markov chain. To prove the statement, from [8|, we only 
need to show that the probability that every 2k columns of A 
are linearly independent goes to 1 as n goes to infinity. 

Let Aj be a submatrix of A with columns in /, where / 
is an index set with |/| = 2k. Let A Sj i (1 < j < Lf|J) be 
a submatrix of Aj formed by row 2k(j — 1) + 1 to row 2k j 
of Aj. Let Pj denote the probability that rank(j4j)< 2k, and 
let 7r^ denote the probability that rank(As /)< 2k for given 
j. Note that given /, ir^ is the same for every As 1, Vj. Note 
that rank(A/)< 2k implies that rank(Ag /)< 2k for each such 
matrix As 1, then 

^1<(^) LSJ - (2) 

To characterize ir^, consider matrix B 2kx2k with E>n = 
for i — 2, 3, ...,2ft and Bij = 1 for all the other elements. 
Since rank(£?)= 2k, then 

7r^ < 1 — P(Asji is a row permutation of B). (3) 

One can check that in this Markov chain, for every 1 < i < 
k < n, P(X k = 1 I Xi = 1) > 1/2, P(X k = I Xi = 
1) > 1/4, P(X k = l\Xi = 0)> 1/2, and P(X k = 1) > 
1/2 by simple calculation. Since B has (2k)l different row 
permutations, one can calculate that 

P(A Sj i is a row permutation of B) > (2fc)!/2 4fc2+2fc ^ 1 . 

(4) 

Combining (O, (f3]l and (0J, we have 

P (every 2k columns of A are linearly independent) 
=1 - P(rank(yl/) < 2k for some / with |/| = 2k) 



where the first inequality follows from the union bound. Then 

if to = g{k)\ogn = {2k + \)2 ik2 + 2k ~ 1 logn/(2fc - 1)!, 
from © we have the probability that every 2k columns of 
A are linearly independent is at least 1 — l/((2fc)!n). Then 
the statement follows. 

B. Proof of Proposition 

Since checking whether or not r given sets form an re- 
partition takes polynomial time, r-partition problem is NP. 

We will show the r-partition problem is NP-complete for 
r > 3 by proving that the NP-complete r-coloring problem 
(r > 3) is polynomial time reducible to the r-partition 
problem. 

Let G = (V, E) and an integer r be an instance of r- 
coloring. For every [u, v) G E, add a node w and two links 
(w, u) and (w, v). Let W denote the set of nodes added. Add a 
link between every pair of nodes in V not already joined by a 
link. Let H denote the augmented graph and let V' denote the 
set of nodes in H. We claim that if there exists an r-partition 
of H, then we can obtain an r-coloring of G, and vice versa. 

Suppose Si (i = l,...,r) form an r-partition of H. Note 
that for every (u, v) € E, u and v cannot belong to the same 
set Si for any i. Suppose u and v both belong to Si for some 
i. Let w denote the node in W that only directly connects to 
u and v. If w G Si, then w has both neighbors in the same set 
with w, contradicting the definition of r-partition. If w ^ Si, 
then Hyi\Si is disconnected since w does not connect to any 
node in V'\Si. It also contradicts the definition of r-partition. 
Thus, for every (u, v) E E, node u and v belong to two sets 
Si and Sj with i ^ j. Then we obtain an r-coloring of G. 

Let Ct C V (i — 1 , . . . , r) denote an r-coloring of G. We 
claim that Ni = C t (i = 1, r - 1), and N r = C r U W form 
an r-partition of H. First note for every u 6 V, at least one of 
its neighbors is not in the same set as u since Hy is a complete 
subgraph. For every w £ W, w is directly connected to u and v 
with [u, v) E E. From the definition of r-coloring, u and v are 
in different sets C, and Cj for some i ^ j. Therefore, w has 
at least one neighbor that is not in N r . Second, we will show 
Hv'\Ni ls connected for all i, Hy\N r is m fact a complete 
graph, and thus connected. For every i < r, let Si :— V\C,, 
then V'\Ni = Si U W. is a complete subgraph, and thus 
connected. For every w £ W, since its two neighbors cannot 
be both in Cj, then at least one neighbor belongs to Si, thus 
Hv\N r = HsiUW is connected. Ni (i = 1, r) indeed forms 
an r-partition. 
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